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Abstract: The detection of primordial non-Gaussianity could provide a powerful means 
to test various inflationary scenarios. Although scale-invariant non-Gaussianity (often de- 
scribed by the /jvx formalism) is currently best constrained by the CMB, single-field models 
with changing sound speed can have strongly scale-dependent non-Gaussianity. Such mod- 
els could evade the CMB constraints but still have important effects at scales responsible for 
the formation of cosmological objects such as clusters and galaxies. We compute the effect 
of scale-dependent primordial non-Gaussianity on cluster number counts as a function of 
redshift, using a simple ansatz to model scale-dependent features. We forecast constraints 
on these models achievable with forthcoming data sets. We also examine consequences 
for the galaxy bispectrum. Our results are relevant for the Dirac-Born-Infeld model of 
brane inflation, where the scale-dependence of the non-Gaussianity is directly related to 
the geometry of the extra dimensions. 
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1. Introduction 

Recent Cosmic Microwave Background (CMB) measurements e.g. |j]] have tightened con- 
straints on cosmological parameters, verifying the inflationary predictions of a flat universe, 
structure formation from primordial adiabatic super-horizon perturbations and nearly scale 
invariant perturbations with a slightly red spectrum, and have started ruling out specific 



inflationary models (e.g. X(p 4 model). However, the observables measured so far have lim- 
ited power to distinguish between scenarios, as the efforts to reconstruct the inflationary 
potential demonstrate e.g. [^, ||, ||]. There are at least two possible observables, acces- 
sible in the near future, that have the potential to rule out or support large classes of 
models: non-Gaussianity and primordial gravitational waves. Here we focus on the first 
possibility, although both may be related in an interesting way which we review briefly 
in § |2.2| . Non-Gaussianity is particularly valuable because it probes details of the inflaton 
self-interactions during inflation and can distinguish properties of the inflaton Lagrangian 
that cannot be constrained by the power spectrum alone. In this work, we would like to 
emphasize observable and scale-dependent non-Gaussianity as a signature of (so far) non- 
standard inflationary physics and demonstrate that near-future observations on a range of 
scales can provide important constraints for scale-dependent scenarios. 

The simplest single field, slow-roll inflation predicts nearly Gaussian initial fluctuations 
H |6|, 0, ||, ||, [H], 11, 12 1 where the deviation from Gaussianity is unobservable [b|, 14]. 



Multi-field models may or may not produce interesting non-Gaussianity, depending on 
the model ]|15[|. For a review of possibilities known through 2004, see for example | p.6| [. 
However, even single-field inflation may generate significant non-Gaussianity if the inflaton 
has a non-trivial kinetic term 1 . Mukhanov and others 1 15 , |H| have investigated the case of 



a Lagrangian which is a general function of the inflaton and powers of its first derivative. 
These models are characterized by a sound speed c s different from 1 during inflation, 
which changes the time of horizon exit for scalar modes (c s k = aH) but not for tensor 
modes. (Here c s is dimensionless, normalized by the speed of light c = 1.) The primordial 



bispectrum of these scenarios has been worked out in detail [20, 21], with the important 
result that higher-order derivative terms introduce new and dominant contributions to the 
three point function. For small sound speed, these terms can generate non-Gaussianity 
that is observably large. In addition, the sound speed may change during inflation, leading 
to scale-dependent non-Gaussianity. 

A form of non-Gaussianity of initial conditions that is widely used in the literature is 



IH, [23 



C(x) = ( G (x) + lf NL [<* (x) - <Cg(x)>] (1.1) 

where C( x ) is the primordial curvature perturbation, Cg( x ) is a Gaussian random field and 
the degree of non-Gaussianity is parameterized by (constant) Jnl- Here a positive Jnl 
leads to a positive skewness in the density perturbations 2 . For non-Gaussianity of this type, 
CMB data are expected to yield the strongest constraints on f^L |^3|i ^6|- Current CMB 
data (WMAP3) already constrain —36 < f^L < 100 |27| and could potentially achieve 
\f NL \ ~ few ||, HI f§. 



1 Single field models which temporarily violate the slow- roll conditions may also give significant non- 
Gaussianity. The case of a step in the potential is analyzed in Il7| and is easily distinguishable from the 
scenarios discussed here. 

2 There are several different conventions in the literature for defining /jvx through an equation like 
Eq.([l.l|), Here we use the same convention as WMAP |25|, but see Appendix A..2 for a discussion and 
details of some of the conventions used by previous authors. 



The three-point function of the Fourier transform of Eq. (|l.l| ) has a particular de- 
pendence on the three momenta (k\, foj £3) that has been dubbed the local shape pO|j , 
Scenarios where non-Gaussianity is generated outside the horizon (including the curvaton 
scenario pl| and the variable decay width model p2[ ) have this shape. A characteristic 
of the local shape is that the magnitude of the three-point function is largest when one of 
the momenta is much smaller than the others (the squeezed limit). For scenarios differ- 
ent from the local case, an effective parameter fff/ may be defined from the magnitude 
of the three-point function evaluated at k\ = k-i = ks (the equilateral limit). This has 
been used to indicate the amount of non-Gaussianity in various models and is the variable 
CMB experiments report constraints on fl2q|. However, possibly a better single number to 



compare between models is the skewness (defined in §4.1), which integrates over all shape 
configurations of the three-point function in /c-space. 

The dominant contribution in single field models with higher derivative terms has 
a very different shape in Fourier space: it is largest when all momenta are equal. A 
momentum-dependent estimator which has its maximum in the equilateral limit and can 
be efficiently compared to data has been used to constrain the magnitude of equilateral type 
non-Gaussianity from the CMB [^, ^7| . We will call the A;-space configuration dependence 
of the estimator the equilateral shape and the corresponding effective parameter f^ L - 
From the WMAP three year data, Creminelli et al. find -256 < f$ L < 332 [{27]]. The 
equilateral shape differs by about 20% from the more general sound speed case, as we 
review below. More importantly, the non-Gaussianity can be scale-dependent, and in this 
case observational constraints assuming scale- independence should be revisited 3 . In this 
paper, we will concentrate on non-Gaussianity that runs (that is, has scale-dependence). 
Running non-Gaussianity may be a natural phenomenological question (and in fact even 
the slow-roll non-Gaussianity runs if the spectral index does), but it also has significant 
theoretical motivation. As we will review, the best-studied string theory model that gives 
rise to large non-Gaussianity has a natural scale-dependence. If the non-Gaussianity is 
large enough to be observable, its scale-dependence should also be investigated. 

Broadly speaking, constraints on primordial non-Gaussianity from observations of 
galaxies and clusters fall under two categories: observations of rare objects (such as galaxy 
clusters) which are sensitive to the tail of the probability distribution for density fluctua- 
tions, and measurements of higher-order clustering statistics of the density field. It has long 
been recognized that even small deviations from Gaussianity in the primordial fluctuation 
distribution would cause significant changes in the high mass tail of the halo distribution 
1 34, 35, 36, 37, [38], [J9], [5], 26]. Therefore observations of rare and/or high-redshift collapsed 



objects could place interesting constraints on non-Gaussianity, particularly for models in 
which the magnitude of the non-Gaussianity increases significantly on small scales. Obser- 
vations of galaxies and clusters of galaxies provide valuable information on scales different 
from CMB observations, as illustrated in Figure H. In addition, they are subject to a very 



3 The constraints on /jvl from WMAP are derived assuming a scale independent Jnl- Here we assume 
that the WMAP bound is valid for JNL^-max) where £ m ax — 475 is the maximum scale used in the analysis. 
If the data were reanalyzed permitting a scale-dependent non-Gaussianity it is likely that the constraints 
on /jvl would be relaxed. 



different set of systematic errors and observational constraints, strengthening their utility 
as a complement to CMB anisotropy observations. 

Higher-order clustering statistics, such as the galaxy bispectrum, depend upon the 



correlation statistics of the primordial curvature perturbations [ 41 , |^, |43|, [44], [4^, |4(| [47| . 
Unlike tests of non-Gaussianity from the abundance of collapsed objects, the galaxy bis- 
pectrum retains information about the shape-dependence of the primordial three-point 
function. As pointed out in [^7]] the bispectrum is potentially a powerful tool for distin- 
guishing between the local and equilateral models we discuss in §[3|. Measurements of the 
galaxy bispectrum can constrain /tvl on scales of tens of h~ 1 Mpc. Current constraints 
from large scale structure, assuming a scale-independent /jvx of the local type, are at the 
level \f NL \ ~ 2000 @ but future data set could achieve \Afft c L al \ ~ 10 and \Af^ il \ ~ 100 
|45| , f47| . In §|| we briefly discuss the effects of scale-dependent primordial non-Gaussianity 
on the evolved bispectrum. 

Before beginning a detailed discussion of observables, it is interesting to note that 
from an effective field theory point of view one does not expect to see Jnl more than 
a few without significant fine-tuning. For slow-roll models with standard kinetic terms, 
/jVX is proportional to slow-roll parameters. For example, the contribution from the third 
derivative of the potential goes like Xttj <C 10 -4 , where the small number comes from the 
amplitude of scalar fluctuations and the inequality comes from demanding a flat enough 
potential for sufficient inflation. This was shown in detail by Maldacena jl^fl . Terms with 
a shift symmetry in the inflaton field (j), as <p — > 4> + c (with c a constant) may be added 
without spoiling slow-roll, and in particular terms like (V (f) 2n j 'M 4n_4 , with n > 2 and M 
some mass scale, may be added. In order to truncate the added terms at some finite n 
without fine-tuning the coefficients, we need V</> << M 2 , and so as shown by Creminelli 



|4S] expect ftfl of order 1 at most. 



NL 

However, string theory is a natural place to look for a fundamental description of 
inflation, and, as a UV complete theory, contains exceptions to this expectation. While 
there are many string-inspired scalar fields that have been explored for cosmology, brane 
inflation |5(|] is an interesting example where a non-trivial kinetic term arises naturally. 
In this model the position of a brane (extended in our large dimensions and moving in 
the six compact extra dimensions) is the inflaton. The brane dynamics are given by the 
Dirac-Born-Infeld (DBI) action, where the kinetic term for the inflaton (j) is proportional 

to -y/l — (/> 2 r~ 1 (^>), and the quantity T is background dependent 4 . Here the square root 
is a natural higher-dimension extension of the relativistic action for a point particle. The 
square root gives a speed limit for the brane (<fi 2 < T((f))), and the closer the brane velocity 
is to that limit, the smaller the sound speed, c s , of the inflaton. The magnitude of non- 
Gaussianity goes as 1/c 2 , which is reasonable since a small sound speed means that many 
terms in the expansion of the square root are important. In the notation of ||49|| , as c s — > 0, 
V0/M 2 — > 1 from below. In a realistic scenario the speed limit is related to the background 
geometry of the extra dimensions and so can be quite small and 4> (time) dependent. The 



4 There are certainly corrections to this action in a variety of contexts, but the crucial feature of the 
appearance of an exact summation of powers of derivatives should remain. 
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Figure 1: The quantity \fNh{k)\ for several different values of the running of the non-Gaussianity 
ting ~ 1 = din fNL/dhik. The solid line has ting ~ 1 = 0, the dashed ting — 1 — 0-2, and the 
dot-dashed ting ~ 1 — 0.6. The shaded region in the upper left hand corner shows the range 
that is excluded at 95% confidence by current CMB data pTJ for equilateral shape non-Gaussianity 
(plotted is the more conservative lower bound on Jnl) ■ The shaded regions on the right shows 
the range of scales probed by the galaxy bispectrum and by clusters. The range of scales probed 
by the bispectrum depends (among other things) on the redshift of the survey, survey volume and 



the number density of galaxies, the above plot assumes V 
k is determined by the nonlinear scale p7J . 
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1 and the maximum 



sound speed may change considerably during inflation, leading to scale-dependent non- 
Gaussianity. Although some work is still needed to see if DBI inflation can be embedded 
in a consistent compactification (see, e.g. [pip, the appearance of the square-root kinetic 
term that gives the interesting features is quite suggestive of possible stringy signatures. Its 
importance is enhanced by the typical difficulty of finding flat potentials in string theory. 
Inflationary toy models with tachyons contain similar exact summations of a series of higher 



order operators [52]. 

From here, §p| reviews the relevant features of general single-field models, emphasizing 
scale-dependence and the consequent relations between observables. We use brane inflation 
to illustrate how scale-dependence may arise naturally in scenarios where the inflaton 
field sees a higher dimensional background. For readers wishing to avoid wading through 
equations, the key expression is Eq.(2.12) and its application in Eq.([3.6|), which gives our 



ansatz for the scale-dependence of the non-Gaussianity. In §|3| we compare the three-point 
functions for the local (/nl), general sound speed, and equilateral models. Next, we derive 



in §4.1 and §4.2 a non-Gaussian probability distribution and mass function and §4.3 presents 
current constraints and forecasts. In §[| we illustrate the effect of scale-dependence on the 
bispectrum. We summarize the utility of these observations and some issues relevant for 
future work in §|6|. Appendix [A] contains a clarification of previous conventions for /nl- 
Some issues in deriving a useful non-Gaussian probability distribution function (which 
are also relevant for simulations) can be found in Appendix [B|. The second half of that 
appendix compares our mass function with a suggestion found previously in the literature. 

2. General Single-field Inflation: Review and Example 

In this section we review general single-field models. For readers interested primarily in 
observational results, only the first subsection and Eq.( |2.12 ) are necessary to continue to 



§3. In §[2^ we discuss relationships between observables when the sound speed is small 
and changing, with the relationship between non-Gaussianity and the tensor-scalar ratio 
perhaps the most interesting. The last subsection introduces an example from string theory 
that is easy to visualize, and where a changing sound speed is natural. 

2.1 General Single Field Formalism 



Here we follow the notation of Garriga and Mukhanov [19|. The most general single-field 



model can be described by the action for the inflaton field <p, including gravity: 



i f d 4 x^-g [M 2 p R + 2P(X, <f>)} , (2.1) 



S= - 
2 

where X = — ^g^d^dycf) and R is the curvature calculated from the 4D metric g^ u . The 
function P is the pressure and the energy density E is given by 

E = 2XP lX -P , (2.2) 

where P,x is the derivative of the pressure with respect to X. We use the Friedmann- 
Robertson- Walker metric with scale factor a(t) 

dsl = -dt 2 + a 2 (t)5 ij dx i dx j . (2.3) 

The sound speed c s is given by 

2 _ P,X P,X 



E, x P, x +2XP, XX 



(2.4) 



The Friedmann equation relates the Hubble parameter {H = -) to the inflationary energy 
density as usual 

3M 2 H 2 = E . (2.5) 

Throughout we use the reduced Planck mass, M p = (SitGn)^/ 2 . To have the usual 
equation of state, we assume the potential energy is the dominant contribution to the total 



energy. Two combinations of derivatives besides the sound speed appear in the 3-point 
function: 



S = XP X + 2XP XX (2.6) 

2 3 
\xx + t;X P,xxx ■ 



A = X 2 P X x + lx 3 P- 



These models are not necessarily slow-roll in the sense that V,^ /V may not be small. 
However, there are suitable parameters analogous to the usual slow-roll quantities that are 
useful for expressing observables. These make use of the Hubble parameter H and its time 
derivatives instead of the potential (the Hamilton- Jacobi formalism). In the notation of 



|2g, m these are 



' = -4 (2 - 7) 

Inflation takes place for e < 1. When e, 77 and k are all small, to first order the scalar 
index and the tensor/scalar ratio are given by 

n B -lta-2e-r)-K (2.8) 

r = 16ec s . 

The power spectrum of the primordial curvature perturbation is given by 

<C(ki)C(k 2 )> = (2vr) 3 5 D (k 1 + k 2 ) (2 ^3 (fcl) (2.9) 

where V^(k) oc k ns ~ l is the dimensionless variance and 5d is the Dirac delta function. 

A detailed calculation of the bispectrum for models with arbitrary sound speed can 
be found in [21]. The non-Gaussianity is enhanced by l/c^, and new terms are generically 
present that are not suppressed by the small parameters e and r\. These terms vanish 
when c s = 1 (equivalently P,xx= 0). Notice that in the slow-roll case, self-couplings of 
the inflaton that appear in the potential must be small to preserve slow-roll and the small 
amplitude of fluctuations. For small sound speed models, higher-order derivative terms 
do not interfere with slow-roll and lead to terms in the 3-point function which are not 
suppressed by e, 77. 

For the general action in Eq.( |2.1| ), the leading order contributions to the three point 



function of the primordial curvature may be written [21] 



(C(k 1 )C(k 2 )C(k 3 )) = (2^) 7 Mkx + k 2 + kO^^Sr (^ + Ac + 0{e/c 2 s ) + 0(e)) (2.10) 

K 1 K 2 K3 



where K 



i + k 2 + k 3 c 


md 


■M?- 


-l-|[2-(3- 


A = (3 ■ 


-).(-*§ 



^ 2^ ** *J + ^2 Z-> k i k j + $l^ k i 



where ci « 0.577 is the Euler constant, and / = X/(HX) is of the same order as the 
slow-roll parameters 5 . There are several terms suppressed by slow-roll parameters hidden 
in the 0{e/(? s ) and 0(e) terms in Eq.( [2,10 ). These should also be evaluated to give a 



precise answer for any model. However, here we are concerned with an investigation of the 
magnitude of scale-dependence that can be constrained by experiment so we ignore these 
terms. Notice that the dimensionless power spectrum V^(K), evaluated at K = k\ + k 2 + k% 
has been pulled out. This is a choice (a similar one is often made in the /jvx formalism), 
but the full /c-dependence of Eq.(2.10) is of course fixed and can be written independently 



of the power spectrum. 

The sign of the dominant piece depends on details of the Lagrangian. In particular 
the first line above, A\, is n °t suppressed by a slow-roll parameter and can a priori have 
either sign. On the other hand since c s < 1 the second term, A c , is negative. For the DBI 
model presented in the next section, 2A/S = \j(? s — 1 so the largest terms in A\ vanish, A c 
dominates and the skewness is negative. The consequences that we show for the A c term 
(which we call the "A c shape") are roughly those of the DBI model. 

2.2 Scale-Dependent Relationships 

Scale dependence in the sound speed means that there is correlated scale dependence among 
the observables. In order for Eq.([2.§|) to be valid, we assume the parameter k is a good 
expansion parameter (|k| < 1). The sound speed may vary more quickly, but then the 
analysis of even the two-point function must be revisited (see discussion in pi]]). When 
the sound speed c s does not change too rapidly, its scale dependence can be characterized 
in a manner similar to that of power spectrum |65| . That is, 



_ d\n(c s 2 ) 

Notice that for k < the amount of non-Gaussianity increases with scale. For models 
where 4> = —2MpH'c s we have 

n NG - 1 2k . (2.13) 

We saw in Eq.(|2.S|) that the usual slow-roll consistency equation for the tensor/scalar 
ratio is modified even for a constant sound speed c s 7^ 1. 

r = 16ec s . (2.14) 



Notice that terms are not grouped completely by slow-roll order, so that some terms in A\ are of the 
same order as terms we have not explicitly written out. 



This expression demonstrates an interesting relationship between measurable values of r 
and measurable non-Gaussianity. Clearly, a very small sound speed will depress the ratio. 
However, from effective field theory with a constant sound speed, we do not expect to find 
observable tensor modes anyway. This expectation is commonly expressed as the Lyth 
bound [53] 

1 d(j> fr 

w p dN e = y8- (2 - 15) 

Which, for r nearly constant, says that requiring A(fi/M p < 1 and about 60 e-folds suggests 
r less than a percent. The CMB plus SDSS results constrain r < 0.28 at k = 0.002Mpc~ 1 
p|, which is far above this expectation. If the sound speed changes during inflation then r 
is no longer constant and it may be possible to find a somewhat larger tensor /scalar ratio 
at CMB scales. We would then also expect a non-Gaussian signal which is smaller on the 
largest scales and increasing at smaller scales. 

2.3 An Example from String Theory 

Here we introduce brane inflation as a model for generating and constraining large non- 
Gaussianity. We review a few pertinent details and discuss why it is such an informative 
context in which to consider smaller scale limits on non-Gaussianity. We introduce it purely 
as a case in point, demonstrating that small and changing sound speed arises naturally in 
string theory. 

From a model-building perspective, brane inflation has several attractive features. 
First, it sits in the well-explored arena of IIB flux compactifications [p4|, |55|| and has been 



extensively compared with observations [pq, 57, 58, p9|. More importantly, it can poten- 



tially exhibit stringy signatures, like substantial non-Gaussianity. Many models of inflation 
in string theory have inflaton potentials that are too steep to naturally generate 55-60 e- 
folds. In brane inflation the potential is also expected to be rather steep |5^, |6(], ^lj, but the 
square-root kinetic term allows enough inflation by forcing the brane to "slow-roll" in the 
sense of small <fi p2|, |63|] . The field then spends long enough rolling down to generate enough 
inflation. Many of the interesting observational features (including non-Gaussianity) are 
linked to the "speed limit" feature that is so useful for obtaining inflation. However, it 
is not yet clear if there is an honest way to embed this model in a fully consistent com- 
pactification. Experiment may resolve the issue faster than theory, as many parts of the 
parameter space can be ruled out in the near future as constraints on the tensor/scalar 
ratio and non-Gaussianity are improved. 

The original brane inflation model used the attractive force between a D3-brane and 
an anti-brane (D3) as the inflaton potential, with the brane separation as the inflaton. This 
simple idea can be made more realistic by placing the branes in a compactified description 
of the extra dimensions. We will not worry about the details of such a procedure beyond 
assuming a reasonable local background geometry that gives rise to scale-dependent non- 
Gaussianity. It is a property of that background (the warping) that causes the sound speed 
to change during inflation. 

In brane inflation the canonical inflaton </> is related to the brane position. The basic 
action for a D-brane is the Dirac-Born-Infeld (DBI) action together with a potential V{4>) 



that is in principle known. Writing this action in a simplified form gives 



<fx a 6 (t) 



TWy/l-ft/TW+VW 



(2.16) 



where a(t) is the usual scale factor and T(cp) is a non-trivial function of <ft that comes 
from the background six-dimensional geometry the brane moves in. The importance of 
the square root for the inflationary properties has led to the name 'DBI model' for brane 
inflation in a (^-dependent background geometry. The effects of the square root are usefully 
captured by a Lorentz factor 7: 

7(0) - - I (2.17) 



which again demonstrates that the speed limit for <f> depends on the function T(<p). A 
functional form that arises naturally is T(</>) ~ </> 4 with corrections so that T{4>) goes to a 
constant as cj) goes to |34|]. Then 7 may grow quite large at small cf). In the case of the 
above square root action, there is a simple relationship between 7 and the sound speed: 

c s ,dbi = - (2.18) 

7 

For a brane moving toward small in a geometry that gives T(<p) ~ (ft 4 , the speed limit 
(cp 2 < T((f))) becomes more constraining, so 7 grows during inflation and the non-Gaussian 
signature (which is oc 1/c^ — 1) increases. The ^-dependence of the non-Gaussianity be- 
comes scale-dependence in the observed primordial gravitational field as the fluctuations 
are frozen in at horizon crossing, c s k = aH. The parameter k captures the change in sound 
speed, and in this paper we will treat k as a constant (although in the full solution it is 
not, and changes in k can distinguish features of the geometry). In the limit of large 7, 
K — ► — 2e and r/ — ► 0, so n s — ► 1. In a variation of this scenario, the brane may start in the 



bottom of the throat and move out relativistically 65, 66, 67, |68|. In this case the speed 
limit constraint decreases as inflation progresses, so the non-Gaussianity would be largest 
at larger scales. In other words, the scale dependence of the non-Gaussianity provides 
information about the geometry of the extra dimensions. 

3. Sizes, Shapes and Scalings 

So far we have discussed scale-dependence of non-Gaussianity which can arise in models 
of inflation with varying speed of sound. An equally important feature is the configura- 
tion dependence of the higher-order correlation functions. In this section we review and 
discuss the value of two categories of configuration dependence or shape of the primordial 
bispectrum and how these may be adapted to allow for scale dependence. 

3.1 The Importance of Shape 



As discussed in [30 1 , two qualities distinguish the primordial bispectrum from sound speed 



models from that of local models. The first is shape, or the difference in the dependence 



10 



of the three-point functions on the relative magnitudes of ki, k 2 and k 3 . The second is 
the scale dependence of the sound speed c s (k) which gives rise to a scale-dependent non- 
Gaussianity which we will address in the next section. 

In the literature the non-Gaussianity is often assumed to be of the "local" type defined 
through Eq. fll.ip . The amplitude of (£q) is expected to be small so that the expansion 
is a good approximation so long as |/jvx| << (Cg) 2 - Measurable quantities such as 
correlation functions of the non-Gaussian field £ can be worked out from Eq.( |l.l[ ), 



(C(ki)C(k 2 )) = (C G (k 1 )Cc(k 2 )) + OUlal)) (3-1) 

L*o htO Lo 

' t l' t 2 fi '3 



(C(k 1 )C(k 2 )C(k 3 )) = (2n) 7 5 D (k 1 + k 2 + k 3 ) 1J ^A local (k 1 ,k 2 ,k 3 ) + 0(f NL (C 2 G 



where 



A local (h,k 2 ,k 3 ) = ^L K -2(ns-l) (fi(k 2 h) n '- 1 + k%(k 1 k 3 ) n '- 1 +k$(k 1 h 2 ) n '- 1 ) . (3.2) 

For general non-Gaussianity the 3-point function will not have the above form. In 
particular if we compare Eq. Q3.lD and Eq.( |2.10| ) we see that the k dependence is quite 
different, this is illustrated in panels (a) and (b) of Figure @. One can nevertheless define 
an ftf£ by taking the equilateral triangle limit (ki = k 2 = k 3 ) of Eq. Q2.lCQ and Eq. flOl) , 
and identifying the coefficients 7 : 



*eff _ 

JNL — 


fNL + fNL + 0(l/cl)+0(< 


Inl = 


i^'Gr 1 -! 12 - 


Inl = 


-m^-H^- 1 )- 



(3 - 2ci)Z] 

(3.3) 



Notice that Eq.(Ll) assumes a constant /jvx, while the models described in §|| generally 
have c s (k). The /^-dependence causes additional difficulty with assuming the local param- 
eterization for non-Gaussianity. 

One could imagine using the local form of the non-Gaussianity with the above def- 
initions of fftl to estimate the non-Gaussianity from sound speed models, but this ap- 
proximation is not very good. This fact is illustrated in panel (b) of Figure |2|, where the 
fractional difference between A c (the dominant contribution for DBI inflation) and the lo- 
cal form Ai oca i using the f^j[ value in Eq.(^) is plotted as a function of k 2 /k\ and k 3 /k\. 
The fractional difference is > 1 for much of the allowed k 2 jk\-k 3 jk\ range. 

3.2 A Scale-Dependent Equilateral Model 

Creminelli et al. p8| , B7J have proposed a functional form which closely approximates the 
behavior of the three-point function when a single higher derivative term, (V(/>) 4 /(8M 4 ), 



6 Here and throughout this paper we assume the variance, V^ is described by a power law, that is n s — 1 
is independent of k. Equations allowing for a general V^ would be more complicated. 

7 Note that our definitions of Jnl for sounds speed models differ from those in CHKS pi] , see Appendix 
N for a discussion of this issue. 
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Figure 2: (a) The shape of the primordial bispectrum for the local model, 
■Aiocai{X,k2,kz) / {k2ks) / Jnl- The domain of the plot is restricted to ki + k 2 + k 3 = 0. (b) 
Contour plot of the fractional difference between the local form of non-Gaussianity and the 
DBI shape. Shaded regions show contours of (beginning from the upper left-hand corner) 
{■Aiocai — Ac) /A c =0, 0.05, 0.1, 0.5, 1, 2, 10. (c) The dominant shape in the primordial bispectrum 
for the DBI model, plotted is -4 C (1, fe, k^) / (k2k^) / f^ L .(d) Contour plot of the fractional difference 
between the equilateral form of non-Gaussianity and the DBI shape. Shaded regions show contours 
of (beginning from the upper left-hand corner) (A egu u ~ A c )/A c — 0, 0.01, 0.02, 0.05, 0.1, 0.25. 



has been included in the Lagrangian and gravity has been ignored. This shape is also useful 
for efficient data analysis. This so-called equilateral shape has a three-point function with 
the form 



9 



Ae.uu.i^MM) = ^f$ L K- 2{ns - l) {-k\{k 2 k^- ] 



+ k{ 



+ V eim.-2(k 1 k 2 k 3 ) 1+2 ^- 1 ^ 3 

2+(n»-l)/3 , l+2(n s -l)/3 , n ,-l 



n ■) 



Ao 



+ perm. 



(3.4) 



Unlike the local shape, the maximum signal in A equ u(ki, k 2 , ^/(kfk^k^) occurs for config- 
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urations with k\ ~ k^ ~ &3- Recall that in the local shape the maximum contributions to 
the skewness occur for "squeezed" configurations k\,hi » k%. Current (95% confidence 



level) constraints on the equilateral shape from the CMB are [27] 



-256 < f$ L < 332 . (3.5) 

The parameterization of Eq.(|l.l|) and previous work on the equilateral shape assume a 
scale independent /jvl; with the possibility of scale-dependent non-Gaussianity the bounds 
on an /jvx-type parameter must be determined at various scales. Unlike the local model, 
there is no requirement that ft? L for the equilateral model be constant 8 . As discussed in 



§2.2, inflationary models with changing sound speed predict that the non-Gaussianity will 
run with scale. We can extend any model, including the equilateral model, to allow for 
scale-dependent non-Gaussianity by setting 

fNL W = fNL, CMB ( J. J • ( 3 - 6 ) 

where |«| <C 1 is a free parameter. This choice is motivated by Eq. (|2.13| ), but we only 
consider the simplest choice that k is constant, at least between CMB and cluster scales. 
The choice of pivot scale given as kcMB m Eq.(|3.6j) is of course arbitrary. In this paper 
we are primarily interested in constraining non-Gaussianity that increases at small scales 
(large k) so we choose the pivot scale to be just below the minimum scale that has been 
used to constrain non-Gaussianity in the CMB, kcMB = 0.04 Mpc . In this case n < 0. 
For non-Gaussianity that decreases on small scales, k > 0. 

In Figure ^ the shapes of Ai oca i and A equ u are compared with the shape of A c . The 
equilateral model is clearly a superior approximation to the true A c shape. However the 
two shapes still differ by a non-negligible amount so we will continue to show predictions 
for both the equilateral and the "DBI-like" A c shape. Notice from Eq. (|3.3|) that the DBI 
model gives a negatively skewed distribution for the density fluctuations. In the case of 
the DBI model (keeping only the c term in Eq.( |2.1l| ) and ignoring the difference between 
the the equilateral model and the c term) Eq.( |3.5[ ) gives a constraint on the sound speed 
at kcMB 

c s ,DBi(kcMB) > 0.034 . (3.7) 

4. Tests of Non-Gaussianity on Sub-CMB Scales: Cluster Number Counts 

If non-Gaussianity increases at small scales, then it could evade CMB constraints but still 
leave an observational imprint on large-scale structures. There are at least two ways of 
testing non-Gaussianity at small scales: through higher-order correlations of large-scale 
structure probes (e.g. galaxy bispectrum and the three-point correlation function) and, 
reaching even smaller scales, the abundance of collapsed objects. The first approach relies 



3 Of course, one can imagine a configuration that is maximal for squeezed configurations but is not 



the local type as given in Eq.(l.l). Then there can certainly be scale-dependence. This is precisely the 
situation with the terms from standard slow-roll: they are maximum in the squeezed limit and proportional 
to parameters that may change during inflation. 
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on knowledge of higher-order correlations of the primordial curvature perturbation which 
are then evolved by means of cosmological perturbation theory (for a review see [|6Ef|). 
The second approach requires knowledge of the probability distribution from which the 
smoothed density fluctuation is drawn. This probability distribution is most sensitive 
to the skewness of the primordial curvature perturbation (assuming reasonable ordering 
of the cumulants) but in principle requires knowledge of an infinite number higher-order 
correlation functions. The evolved bispectrum retains the full information about the shape 
of the primordial bispectrum which is lost in the smoothed skewness used for predicting 
the abundance of collapsed objects. Both the galaxy bispectrum and the abundance of 
collapsed objects are sensitive to the scale-dependence of the primordial bispectrum. We 
focus on cluster number counts first since these probe smaller scales, the bispectrum is 
considered in §|5[ 

While in the derivations we will try to be as general as possible we make a few assump- 
tions about the cosmology. In what follows we assume a flat AC DM universe. In plots 
and calculations that require input of cosmological parameters we assume the WMAP III 
maximum likelihood values Q: a Hubble parameter today of Hq = WOh with h = 0.73, 
&"m = 0.24, Q\ = 0.76 and Q^h 2 = 0.0223 as the fractional densities in matter, vacuum and 
baryons today, a scalar spectral index n s = 0.958, and that our power spectrum is normal- 



ized to cr 8 = 0.77 (corresponding to P c (fe) « 2.22 x 10~ 9 ( nn ^ _ x ) ). We will use the 



k 0.002Mpc 

transfer function of |7(J with a modified shape parameter T = f2 m /iexp[— 0&(1-|- y2h/Q m )] 
to calculate the linear power spectrum for matter perturbations [fnfl. 

4.1 The Non-Gaussian Probability Distribution Function for the Smoothed 
Density Fluctuation 

We are interested in predictions for rare objects, that is the collapsed objects that form in 
extreme peaks of the density field <5(x) = 5p/p. The statistics of collapsed objects can be 
described by the statistics of the density perturbation smoothed on some length scale R 
(or equivalently a mass scale M = n^R 3 p)- This is given by 



W= I 7^W R (k)5(k,z) (4.1) 



d 3 k 

where Wn{k) is the Fourier transform of a window function, which we take to be a top-hat 

in real space giving 

TTr .,, 3sin(kR) 3cos(kR) 

The relation between the primordial curvature perturbation £ and the linear perturbation 
to the matter density 5 = 5p/p today is 

5{k,z) = Af(fc,z)C(k) 
M(k,z) = l-^-^D(z)T(k)k 2 (4.3) 

where D{z) is the linear growth function, z is the redshift, and T(k) is the transfer function 
which describes the suppression in amplitude of the modes that entered the horizon during 
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the radiation dominated era. The linear matter power spectrum is then given by Pi(k) = 
2n 2 M{k,z) 2 V { ^{k)/k i . 

To incorporate non-Gaussian initial conditions into predictions for the smoothed den- 
sity field we need an expression for the probability distribution function (PDF) for 8 R . For 
a particular real-space expansion like Eq. ( |1 . 1[ ) , one may make a formal change of variable 
in the Gaussian PDF to generate a normalized distribution |4C]. However, as we discuss 
below this expansion is not particularly well-matched to the DBI scenario, and it is not 
obvious what expression analogous to Eq,(|l.l|) captures the full fc-dependence of realistic 
models. However, there is a straightforward mathematical relationship which allows one 
to build up the PDF from the cumulants and a known distribution like the Gaussian. This 
technique leads to the well-known Edgeworth expansion and is especially useful for quan- 
tities that depend on the central part of the PDF. We will demonstrate that it is also valid 
in a regime that allows us to probe clusters, at least for small enough masses and redshifts. 

For a probability density function P(5n)d5R, the n-th central moment is 



(S. 



R) 



5 n R P(5 R )d5 R 



(4.4) 



The n-th cumulant K n is the connected n-point function, so that 



«l 

«2 
«3 



(Sr) 

(4) 



(Sr) 2 c 
(5r)-Z(Sr)c(Sr)c 



(S 



Rlc 

& \2 



S(S 2 r)(Sr) 2 



ml 



(4.5) 



(4.6) 



and the reduced cumulants are defined as 



p-1 



One may define a generating function for the S p by 

s(y) = J2s P (RV— l ^y p 



p=2 



pi 



(4.7) 



(4.8) 



Then an exact expression for the PDF in terms of the cumulants is given by 



P(S R )dS R 



d$R 1 r° ,,.^\ySR S(y) 



®R J —too 



dyexp 



°r a R 



where we have defined a R = K2- Using the saddle point approximation in Eq 
collecting terms of the same order, one arrives at the Edgeworth expansion: 



P{v)dv 



dv 



-I/ 2 /2 



l + <TR^H 3 (v) + a R l Si(R) 



6 



24 



H&) + ^H 6 (u, 



(4.9) 
Q and 

+ ... 
(4.10) 
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where v = Sr/ctr and the H n are Hermite polynomials 

H 3 (u) = v 3 - 3u (4.11) 

H 4 (u) = u 4 -6u 2 + 3 

H & {v) = v 6 - 15u 4 + A5u 2 - 15. 

If only the first few terms in the expansion are kept the recovered probability distribution 
will be approximate and have a limited range of validity. For example, the PDF can develop 
negative regions if only the first S3 term is kept. To use the Edgeworth expansion for given 
values of 5r,ctr, S*3(-R) ... we will therefore have to check that we remain in a region where 
the PDF is well behaved. This is discussed in more detail in Appendix 0. 
The variance of the smoothed density fluctuation is 



W? J ^f W ^ W ^ k ') M ^ ^ M ( fc '' *)<C0OCOO> 

'-W R (k) 2 M(k,z) 2 V c (k) (4.12) 



i) k 



where we have used Eq.(^l^) to relate 5 and £. The smoothed variance for a (real space) 
top hat window function is shown in Figure |3[ The smoothed skewness is calculated from 
the three-point function 

/r^k f r/^k C d 3 \n 
(2^/(2^7 (^ W i^^3M 1 M 2 M 3 (C(k 1 )C(k 2 )C(k3)) 

l-^r /^VF 1 iy 2 ^ 12 M 1 M 2 M 12 (2vr) 4 (^ W) 2rl%^) (4.13) 



(2tt) 3 J (2tt) 3 l z iZ l z lzy ' v y " kfklk 



where /ci 2 = \Jk\ -\-k\-\- 2ki • k 2 and K = k\ + A; 2 + k% . We have collapsed the 5- function 
in Eq,( |2.ltiD setting k3 = ki + k 2 to get the second line, though in practice it will be 
computationally easier to use the <5-function to collapse different integrals depending on 
the form of the integrand (since A is a sum of terms) . 

The smoothed skewness for the local, equilateral and A c models is shown in Figure 
||[ As expected, the smoothed skewness for the local model is quite different in both 
shape and amplitude from the equilateral and A c models. In Figure ||] we also show the 
fractional error between the equilateral and A c forms of the smoothed three-point function, 
[(5 3 (R)) — (S 3 q (R))) /(5 3 q (R)}. On the scales we are interested in the assumption of the 
equilateral form leads to an error of roughly 20% for several different values of k. Since the 
error is slowly varying across the range of scales clusters probe, one can imagine scaling 
constraints on f^? L to infer constraints on f^ L - 

We will use Eq.( 4.10| ) together with the smoothed cumulants for the various models to 



predict the effect of scale-dependent non-Gaussianity on cluster counts. 

4.2 The Non-Gaussian Mass Function 

The halo mass function gives the number of collapsed structures (halos) at a given red- 
shift, per unit volume with mass within a given mass interval (between M and M+dM). 
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Figure 3: (a) The smoothed variance, (b) The smoothed skewness for the local, equilateral and 
DBI type models all with fffl = —256 and k — 0. (c) The fractional difference between the 
smoothed skewness for the DBI-type model and that for the scale-dependent equilateral model 
with several values of the parameter k. (d) The scale-dependence of the non-Gaussianity is visible 
in S 3 a = (5 3 )/(6 2 f /2 for the DBI model (just the A c term). 



Expressions for the halo mass function in the presence of non-Gaussian initial conditions 



have been derived as extensions of the Press-Schechter formula [72, 36, ^, 4C, 74, 75]. 
Our derivation of the non-Gaussian mass function follows that of |36|, |73| but here we use 
the Edgeworth expansion (Eq. (f4.10|) keeping terms up to order a S3) for our non-Gaussian 



probability distribution. The approach is similar to that of Matarrese, Verde and Jimenez 
(MVJ) ||40|| . However, we take a different approximation in order to have a better ana- 
lytic understanding of the range of validity of the expansion, this is discussed further in 



Appendix |B. 4 . 

In the spirit of Press-Schechter we assert that the fraction of a volume V that has col- 
lapsed into objects of mass M is proportional to the probability that the density fluctuation 



17 



smoothed on scale M is above the threshold for collapse 5 C . This implies that 5 



f c = 2P(>S c ,M) = 2 dSP(8,M). 



(4.14) 



The fraction of volume V that has collapsed into objects with masses between M and 
M + dM is \f c (M + dM) - f c (M)\. This is proportional to the mass of the collapsed 
objects with masses between M and M + dM in V divided by the total mass in V. 



\f c (M + dM) - f c (M)\ 



[n(M + dM) - n(M)\ MV 

w 



(4.15) 



Where dn(M)dM is the number density of halos with masses in the range (M, M + dM). 
Together with Eq.( 4.14| ) this gives an expression for the mass function dn(M)jdM in terms 
of the probability distribution 



dn , , . s 
dM K ' ; 



, P d 

'MdM 



5c/v(M) 



dvP{v, M) 



(4.16) 



where p = £l m Pcrit is the average (comoving) matter density. The redshift dependence is 
carried by the threshold for collapse 5 c (z) ~ 1.686 D(z = 0)/D(z) where D(z) is the linear 
growth function. 

For a Gaussian probability distribution we recover the original Press-Schechter mass 
function 



dn . „ . s 



2 p 5 c (z) dlnaM 



exp[-5 2 c (z)/(2ai I )] 



(4.17) 



7rM 2 a M dlnM 

For the non-Gaussian case, and for the skewness not too large, one can use the Edge- 
worth expansion for the PDF Eq.( [4.10|) . Performing the integral in Eq.( 4.16 ) gives 10 



P(>5 c \z c ,M) 



1 



1 — erf 



\/2<7jU 



S 3 (M)a M 
3! 




Then the mass function is given by 
dn(M, z) 



dM 



(4.18) 



(4.19) 



2 -^e 

IT M 

ldS 3 

+ 6dM aM 



2CT M 



dhxaM 

dM 



ctm 6 



a 



M 



2^1 
M 



a 



a 



M 



9 For a general probability distribution the "fudge factor" of 2 may need to be modified. Pq, uM determine 
this factor by requiring p — J °° M-j^dM. For the cases we consider, this condition may be reasonably 
imposed as long as k — > at some small scale. 

10 If the probability distribution P{u)dv is independent of mass then this step can be skipped and the 
mass function is just given by -j^ = —2p/M-^ ^m" P (fie /cm) recovering the expression introduced in 
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For £3 and all higher reduced cumulants equal to zero, this reduces to the usual expression 
for a Gaussian distribution. What is the effect of including skewness S3 in the mass 
function? The exponential is unchanged with respect to the Gaussian case so we need 
only concern ourselves with the terms in the square brackets. For the models we consider 
dS^/dM has the same sign as £3 on cluster scales. So if S3 > then at the low mass end 
where 5 c /ctm < 1 the number of objects is decreased. At high masses 8 c /(Tm grows rapidly 
so S3 > increases the number of very massive halos. 

Comparison with numerical simulations (with Gaussian initial conditions) have shown 
that the Press-Schechter form of the mass function over-predicts the abundance of low mass 
objects and under-predicts that of high-mass objects. The discrepancy is not surprising 



as e.g. the spherical collapse assumption made to arrive at Eg. ( 4. 17 ) may not hold in 



detail |[7q| . Sheth and Tormen [77] suggested a formula that is in much better agreement 



with simulations, and further improvements were suggested by [78, 7£[. Since these results 
have not be generalized to allow for generic non-Gaussian initial conditions, we will use a 
Gaussian mass function in better agreement with simulations and use the Press-Schechter- 
derived mass function to model departures from non-Gaussianity: 

*£rW>*) = S(M,,) d ^ ( ^ M ' Z) . (4.20) 

dM v ' dM y ; ^(5 3 = ,M,z) V ' 

In the above driQJdM denotes the preferred Gaussian mass function and dnps/dM(S3, M, z) 
denotes the non-Gaussian extension of the Press-Schechter mass function Eq. (|4.19| ) , which 
reduces to the original Press-Schechter expression for S${M) = 0. We discuss recent N- 
body simulations that have investigated the validity of this approach in the conclusions. 

In our calculations we take the Sheth and Tormen mass function as dnc/dM in 
Eq.( [4,2dp . The Sheth- Tormen mass function, obtained from Gaussian initial conditions 
and calibrated on numerical simulations, is given by |77| 

dn S T [2a A / n , ( a5%\~ p \ p S c diner _ aS 2 /{2a 2 ) 



dM 




e~ a0 l^ u > (4.21) 



where a = 0.707, A = 0.322184, and p = 0.3. 

With the mass function in hand, the number of clusters per redshift interval above 
some mass threshold M m i n can be calculated, 



dV(z) f'°° dn 

■(M > M Hm ) = f sky —y- / dM—(M, z) (4.22) 

dz JM mm dM 



where f s k y is the fraction of the sky being observed and the volume element is given by 



dV 4vr 



dz H{z) 



dz' 



H(z') 



(4.23) 



4.3 Predictions for Cluster Number Counts 



Galaxy clusters are the largest virialized cosmological structures in the universe, and there- 
fore provide a unique way to explore non-Gaussianity on scales much smaller than those 
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accessible to CMB anisotropy measurements. In addition, probes of non-Gaussianity us- 
ing cluster number counts are not subject to the same set of systematic effects as CMB 
measurements are, so this method provides a valuable complement to the CMB. Upcoming 
large mass-limited surveys of galaxy clusters with SPT |80|| , ACT [81], and Planck [32| will 



yield data sets particularly well suited to the joint analysis of cluster number counts with 
CMB measurements to probe scale-dependent primordial non-Gaussianity. 

Given a particular form for the primordial curvature bispectrum, Equations ( 4.13J ) 



and fl4,19| ) can be used to calculate the non-Gaussian mass function. We consider the A c 
term (roughly the DBI model) and the scale-dependent equilateral model (Eq. fl3.4p and 
Eq.fl3"H|)). For the A c term we take the sound speed to be determined by Eq. (|3.7| ), for the 
equilateral model we take fNL(kcMB) = —256 and 332 (the extrema of the current 2-a 
bounds from WMAP). For both A c and the equilateral model we consider several values 
of k (see Eq.( |3.6|) ). Figure || illustrates the effect of primordial non-Gaussianity on the 
cluster mass function and Figure |B| shows the effect on cluster number counts with mass 
threshold of M > M Hm = 1.75 x 10 14 /t _1 M sun , roughly as predicted for ACT and SPT. As 
expected, the largest changes to the mass function are at high mass and high redshift. One 
can also see that the equilateral model slightly under-predicts the effect of non-Gaussianity 
from small sound speed models (note that the origin of the equilateral model is a case 
that should be contained in the A c term). Allowing for scale-dependent non-Gaussianity 
can dramatically increase the allowed effect of primordial skewness on the cluster mass 
function while remaining consistent with CMB observations. Figure [| illustrates that even 
for values of f^ L (kcMB) that are within the CMB constraints the amplitude of dN/dz is 
changed by as much as 40%. This suggests that upcoming mass-limited cluster survey data 
can indeed be used in conjunction with the CMB to constrain scale-dependent primordial 
non-Gaussianity. 

Cluster number counts are also quite sensitive to other cosmological parameters and 
systematic errors, so to understand the utility of cluster number counts as a constraint 
on scale-dependent non-Gaussianity we will need to examine degeneracies. In particular, 
the dominant effect of primordial non-Gaussianity is to change the expected number of 
clusters, however under or overestimating the mass threshold Mn m or the value of a% also 
produces significant changes to the amplitude of dN/dz. In the bottom panels of Figure || 
we show the effect of changing the value of as and Mi im for a Gaussian model. The change 
in amplitude of dN/dz caused by a 5% shift in as is significantly larger than the effect of 
primordial non-Gaussianity for all but the most strongly scale-dependent models of ftll(k). 
On the other hand the redshift dependence of a non-Gaussian dN/dz is quite different than 
a Gaussian dN/dz with uncertain as or Mn m so this can be used to disentangle the two 
effects. 

4.4 Forecasted Constraints 

We will use the Fisher matrix method to address the issue of degeneracies between cosmo- 
logical parameters and primordial non-Gaussianity and forecast the sensitivities of future 
galaxy cluster surveys to simultaneously constrain these parameters. We account for sys- 
tematic errors by allowing for an uncertainty in the mass threshold Mn m . While the Fisher 
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Figure 4: The ratio of the non-Gaussian mass function to the Gaussian mass function for DBI 
inflation (green curves, just the c term) with a sound speed that saturates the bound on non- 
Gaussianity at the CMB scales, and for the equilateral shape of the bispectrum with f^ L (kcMB) = 
332 (magenta upper curves) and f^ L {kcMB) = —256 (blue lower curves, showing smaller deviation 
from Gaussian than the c-term). The solid horizontal line is the Gaussian prediction, the dotted 
curves have no running of the non-Gaussianity (k = 0), the dashed curves have non-Gaussianity that 
increases on small scales n — —0.1 and the dot-dashed curves have k = —0.3. The shaded regions 
show the regimes in which corrections to the non-Gaussian mass function from the (S^ct) 2 term 
reach 5% - that is, in the shaded regions the validity of truncating the Edgeworth expansion at the 
first term is uncertain (see Appendix B). The validity of the expansion depends on the magnitude 
of the skewness: the left hand boundary is where the mass function for DBI-type with k = —0.1 
becomes invalid, the right cross-hatched region is where the mass function for f^ L {k) = 332 and 
k = breaks down. All other curves become invalid somewhere between the two boundaries, 
except the k = —0.3 cases, which becomes invalid at lower mass but where the deviation from 
Gaussianity is larger than shown in the range of the plot. For example, for the equilateral model 
with fat = —256, k = —0.3, and z = 0, the expansion is valid for M < 1.55 x lQ 15 h~ 1 M sun \ at 
z = 0.9 the same curve is unreliable above M = 2.25 x 10 14 /i _1 M stm . 



matrix approach is only an approximation (it makes the simplifying assumption of a Gaus- 
sian likelihood) it is nevertheless a useful first tool for producing forecasts and exploring 
parameter degeneracies. 

The observation of a discrete number of clusters is a Poisson process. The probability 
of observing m clusters in the i th experimental bin (bins in redshift, mass or both, for 
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Figure 5: (a) The number of clusters with M > Mu m = 1.75 x 10 14 /i _1 M sun per redshift interval 
per deg. 2 shown for fff L (kcMB) — +332 (magenta), /^(fecMs) = —256 (blue) and for DBI-type 
inflation (green curves, just the c-term) with a sound speed that saturates the CMB bounds. The 
solid curve is the Gaussian prediction, the dotted curves have k = 0, the dashed curve has n = —0.1 
and the dot-dashed have n — —0.3. (b) The ratio of the quantities in (a) to the Gaussian dN/dz. 
Figures (c) and (d) reproduce (a) and (b) but also show the change in dN/dz for a Gaussian 
cosmology if the mass threshold Mu m or erg were changed. Clearly precise knowledge of these 
parameters is necessary to use cluster number counts to constrain primordial non-Gaussianity. The 
hatched and cross-hatched regions are the same as in the previous figure. 
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example) is given by 

f> n i 

P i = ^-exp(-e i ) (4.24) 

mi 

where m is the observed number of clusters in the i experimental bin and ej is the expected 
number in that bin. For a set of N^ na uncorrelated experimental bins the probability of 
observing n\ . . . riN bina clusters in each bin is then 

p = U p = U h exp{ ~ ei) (425) 

»=i »=i *■ 

The expected number of clusters in each bin a is a function of the model parameters (e.g. 
ra , /jvi, . . . ). The maximum likelihood estimates for these parameters are those values 
that maximize the probability distribution P. We identify the likelihood with —2 times 
the logarithm of the probability distribution of the "data", i.e. the observed number of 
clusters in each bin. — 21nP will behave as a \ 2 distribution with the number of degrees 



of freedom equal to the number of parameters [83, 84, 85] . The Fisher information matrix 
is then given by 

F- __(£!£>- 3* 1*1*. (4.20) 

OPaOPb f-' ei dp a dp b 

where the p a are the model parameters we use the cluster counts to constrain. 

We consider the constraints on f^ L (kcMB), o"8i ^m> h, and k. We use a single mass 
bin (all clusters have M > Mn m = 1.75 x 10 hM aun ), redshift bins of width Az = 
0.2 and integrate dN/dz across each bin. Decreasing the bin width and increasing the 
total number of redshift bins does not significantly change our results. For the fiducial 
cosmology we assume equilateral type bispectrum with the WMAP maximum likelihood 
value f^ L (kcMB) = 38 p7|. We consider k = 0.0 and k = —0.3 with all other cosmological 
parameters as given in §0L At high redshift corrections to our mass function become 
important (see Appendix pj) so for the models with k = we limit our analysis to bins 
with mean redshift z~i < 1.3, while for the models with k = —0.3 we use Z{ < 0.9. We 
consider two sets of priors on Q m , h, erg and f^ L - The WMAP forecast assumes an 11% 
prior on TO , 4% on h, 6% on as and a prior on f^ L of ±150 roughly reflecting the current 
state of knowledge from WMAP. The second forecast assumes Planck-like expectations of 



3.5% prior on J7 m , 1.5% on h, 2% on as and a prior on ft? L of ±75 [|^, 27]. A major 



' NL 

source of uncertainty for cluster surveys is uncertainty in the mass threshold Mn m . We 
put a 10% prior on Mu m and all quoted constraints marginalize over Mn m . The forecasted 
constraints are listed in Table [l], for comparison we also list the constraints for fiducial 
models with f NL (k C MB) = -256 and f NL (k C MB) = ±332 (the WMAP III 95% confidence 
bounds) and all other parameters unchanged. 

A few features are apparent: while the constraints on Q m and as are improved beyond 
the prior information, the constraints on h and /^(fccMfl) are not improved by adding 
cluster information. What is gained by the inclusion of cluster counts is a constraint 
on the parameter quantifying the running of the non-Gaussianity, k. Models with larger 
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0.0068 
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Table 1: The forecasted 1-cr errors on f2 m , eg, f^ L {kcMB) and re for three equilateral type non- 
Gaussian fiducial models with Q m = 0.24, h = 0.73 and as — 0.77. The errors are quoted for a 
cluster survey with one mass bin M > Mn m = 1.75 x 10 14 h~ 1 M sun and full sky coverage. To 
determine the errors for a survey with partial sky coverage multiply the quoted error by 1/^/fsky 
We use redshift bins of width Az = 0.2, for the models with re — 0.0 we use 7 bins up to z max — 1.3. 
For those with re = —0.3 we use 5 bins up to z max = 0.9. For each value of re, the values of z max are 
chosen to stay within the regime where the mass function is valid for all three /^(/ccmb) values 
(see Appendix |b|). Note that ACT, SPT, and Planck will provide different constraints as both the 
sky coverage and the depth of the survey will vary - ACT will yield the smallest area, deepest 
survey of the three, and Planck will produce a full-sky survey with a higher mass limit than the 
other two. 



fiducial f^ L (kcMB) or re allow for better constraints on re while for the model with smaller 
Inl^cmb) and re = clusters provide weaker constraints on re. 

Another point is that the constraints on the cosmological parameters O m , h, and ag 
depend on the magnitude of f^ L - This not too surprising because the constraints for each 
parameter p a depend on the magnitude of dei/dp a and on ej for each fiducial model. The 



derivative of mass function, Eq. ( 4.19 ), in the presence of non-Gaussian initial conditions 



will have a sum of terms, some of which are dependent upon 53. Since S3 oc f^ L (kcMB), 
changing f^ L changes the sign and relative magnitude of these terms and therefore the 
magnitude of dei/dp a . On the other hand f^ L (kcMB) affects the number of clusters ej and 
the error on p a is inversely proportional to y/ei- 

A summary of our findings can be seen in Figure 0. Our Fisher analysis shows that 
if \f < x L (kcMB)\ is large (just within the current WMAP bounds) cluster number counts 
will allow one to constrain the running of the non-Gaussianity re. If the running of the 
non-Gaussianity is also large (re = —0.3) then it is likely to be detected by a complete 
cluster survey. On the other hand, if f^ L (kcMB) ~ 38 and re is large, cluster number 
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Figure 6: The joint constraints on f^ L {kcMB) and the running of the non-Gaussianity k for a 
full-sky cluster survey marginalized over fl m , h, ag and Mu m . Shown are 1-er contours for WMAP 
(blue outer curves) and Planck (red inner curves) priors. Note that the plot range varies from panel 
to panel. Recall, f^ L ex k~ 2fi so the degeneracy line changes when the sign of f^ L is changed. 
The fact that k is a slow roll parameter restricts its value to n << 1, the dotted lines show the 
constraints if a Gaussian prior of ±0.5 is put on k to enforce this. 



counts may provide evidence of the running of non-Gaussianity but are unlikely to yield 
strong constraints. Constraints may be improved by grouping the clusters into multiple 
mass bins, retaining some of the information about how dn/dM depends differently on 
each parameter. However, uncertainty in cluster masses and the rapid fall-off of the mass 
function at high masses cause the mass bins to be strongly correlated with one another. 
Finally we note that if we hadn't imposed the prior information on /jvl from the CMB, 
clusters could at best achieve joint constraints of afNL ~ 1000 and Okappa ~ 1 (for WMAP 
priors, with the same survey specifications listed above). If no running of /jvx is allowed 
(k = 0) this improves to be crf NL ~ 100 — 500 depending on the sign and magnitude of 
Jnl- 

5. Tests of Non-Gaussianity on Sub-CMB Scales: The Evolved Bispec- 
trum 



Many planned large scale structure surveys could be well suited to test the primordial 
non-Gaussianity through measurements of the bispectrum. A partial list of surveys that 
may measure the galaxy bispectrum is SDSS §7|], HETDEX |§, BOSS ^, PAU @, 
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ADEPT |1|, LSST g|, WFMOS gg] and SPACE @. A full analysis of the potential 
of large-scale surveys to constrain the scale-dependence of the primordial bispectrum is 
beyond the scope of this paper. However, allowing scale-dependence has an interesting 
feature which we would like to point out. The bispectrum of the density field 

<5(ki)5(k a )<5(ka)) = (2vr) 3 5 D (k 1 + k 2 + k 3 )B(k u k 2 , h) (5.1) 

can be computed from perturbation theory, to leading order this is 

B(h,k 2 , h) = B I (ki,k 2 , h) + B G (k u k 2 , k 3 ) (5.2) 

where B G is the gravitationally induced bispectrum and B, the initial bispectrum, is given 
by 

BtfaMM,*) = M(k 1 ,z)M(k 2 ,z)M(k 3 ,z)B^(k 1 ,k 2 ,k 3 ) 

= (27r) 4 M(fci, z)M(k 2 , z)M(k 3 , z) \> A(h, k 2 , k 3 ) . (5.3) 

K 1 K 2 K 3 



For a review of the calculation leading to Eq.(5.2) and expressions for B (ki,k 2 ,k 3 ) see 

& 

A useful quantity is the reduced bispectrum given by 
n(h h , n B(h,k 2 ,k 3 ,z) 

CA*1,*2,*3,*J P L (h,z)P L (k 2 ,z)+P L (k 1 ,z)P L (k 3 ,z)+P L (k 2 ,z)P L (k 3 ,z) [bA) 

« k > k > k >* = hmB (5 - 5) 

4 , ZfJl CMB {k/k C MB)- 2K 

= 7 + 5M(M) ( } 



where Ph{k, z) is the linear power spectrum given in Eq. Q2.9| ). The first term in Eq. (f>-(\), 
is the gravitational contribution, 11 the second is from the primordial bispectrum. For small 
k (small compared with the horizon scale at matter radiation equality k eq ) M ~ k 2 , while 
for k » k eq , M ~ ln(k). Thus for a constant /jy/, the k dependence of Q 1 ' (k) is fixed, in 
particular \Q (k)\ is a decreasing function. This is in contrast to the scale-dependent case 
where the dominant term has an additional dependence on f^^(k). Recall for a sound 
speed model the dominant terms have f^fl(k) ~ (l/c 2 (k) — 1) where c s (k) ~ k K . If, for 
example, \fpJl (k)\ is an increasing function of k, \Q c (k)\ can be an increasing function of k. 
This point is illustrated in the left panel of Figure [|. For models with n < the primordial 
contribution to the bispectrum turns over and begins to increase at some k. Of course, the 
turnover scale may be at such large k that corrections due to non-linear evolution become 
important and this calculation will need to be modified. 

In contrast to cluster number counts, the bispectrum retains the shape-dependence of 
the primordial bispectrum, potentially allowing one to discriminate between the local and 



11 For a universe with Q m ^ 1, there are corrections of order Q, m ' — 1, we restrict our discussion to 
high redshift (z = 1) where these corrections are ~ 1%. The exact form is calculable, see for instance pty. 
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Figure 7: (a) The reduced bispectrum including both the gravitationally induced non-Gaussianity 
and the evolved primordial non-Gaussianity from a scale-dependent model with /nl = (solid 
black line), /jvi(fe) = 332(k/kcMB)~ 2K (magenta upper curves) and fNb{k) — — 256(fc/fccA/B)~ 2K 
(black lower curves) shown here at Z = 1. The dotted line has k = 0, the dashed K = —0.1 and 
the dot-dashed has n — —0.3. The behavior of the dominant term in DBI with a sound speed 
saturating CMB bounds is identical to the fNh{k) = —256(k/kcMB)~ 2K - In the equilateral limit 
shown above the local and equilateral models agree, (b) The reduced bispectrum as a function of 
(j) where fei • &2 = cos <j>. The solid black curve has no primordial contribution. Upper magenta lines 
have (constant) Jnl = 332, dashed is equilateral shape, dotted is the local shape. The green solid 
curve is the dominant contribution for DBI (just the c-term) with k = 0. Lower blue lines have 
(constant) /nl — —256, dashed is equilateral shape, dotted is the local shape. 



equilateral shapes (e.g. panel (b) of Figure and also f47]]). Additionally, the reduced 
bispectrum is independent of as, which for cluster number counts is degenerate with /nl- 
On the other hand, galaxy surveys measure the perturbation to the galaxy number density 
5 ga i rather than the matter density perturbation 5. One often assumes that the two are 
related by 

5„„i(x) = 6 1( 5(x)+6 2 5 2 (x). (5.7) 



'Jgal 



In [47 1 it was shown that for the equilateral shape bispectrum, there is a degeneracy 
between ff9 L and the bias parameters b\ and 62- Cluster number counts have no dependence 
on these parameters, so they may be useful in breaking this degeneracy. (There is no 
serious degeneracy between f^L and the dark energy equation of state, according to [f4~7|],) 
Perhaps the most serious drawback of cluster number counts, that they are sensitive only 
to the amplitude of S3 and not to the shape of the bispectrum, can be turned into an 
advantage. That is, cluster number counts will be sensitive to any shape (and scaling) of 
non-Gaussianity that leads to a non-zero S3 at cluster scales, knowledge of the particular 
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shape of the bispectrum is less important. Finally, we mention that the systematics involved 
in counting collapsed objects are certainly different than those arising in measurements of 
correlation functions. A multi-pronged approach should be taken. 

6. Conclusions 

Primordial non-Gaussianity is an excellent discriminator of inflationary models. In partic- 
ular, detection of large primordial non-Gaussianity (|/jy/| greater than a few) would be a 
signature of inflationary physics outside of the smooth, slow-roll paradigm |l3|, [lij]. Many 
models that give equally viable predictions for the power spectrum can be distinguished by 
their non-Gaussianity, which may be an important tool for understanding the fundamental 
origin of inflation. Currently there is no reason to expect scale-invariant non-Gaussianity 
(even the slow-roll contribution is scale-dependent, although quite small in magnitude), 
and a number of well-motivated inflationary models predict a non-Gaussianity that can be 
large and run with scale [2C, pif| . The possibility of scale-dependent non-Gaussianity gives 



new importance to observational probes at multiple scales. From a very optimistic point 
of view, observably large non-Gaussianity might be a signature of stringy physics, and in 
that case its running may probe the geometry of the extra dimensions. 

In this paper, we have motivated scale-dependent non-Gaussianity which arises in 
string-inspired inflationary models such as DBI inflation (§ |2.3| ). In the models we discuss 
there are also well-defined relations between the "running" of the non-Gaussianity and 
other observables such as the tensor to scalar ratio (§ p.2[ ). We proposed a simple power law 
description of scale-dependent non-Gaussianity in Eq.( |3.6[ ) valid for general non-Gaussian 
shapes. 

While the CMB constrains deviations from Gaussianity at the largest observable scales, 
the abundance of collapsed objects is sensitive to the non-Gaussianity of the primordial 
density field at small scales. Allowing for scale-dependent non-Gaussianity means that 
models could be consistent with CMB constraints but still leave an observable imprint on 
other scales. For this reason we have used a new expression for the non-Gaussian halo mass 
function: Eq.( [4.19| ) including terms proportional to the skewness, and Eq.( B.5| ) expanded 



to higher-order cumulants. The mass function Eq.( 4.19| ) compares well with other proposed 



non-Gaussian mass functions and offers an analytic understanding of the range of validity 
of the approximations made in the derivation of the expression (see Appendix |B.4| ). 

We have examined the utility of cluster number counts (relevant at a scale an order of 
magnitude smaller than the current smallest CMB scale) to constrain the magnitude, sign, 
and scale-dependence of primordial non-Gaussianity. A modest running of the skewness 
towards small scales leads to an f e J[ at cluster scales that is a few times larger in magnitude 
than frfl at CMB scales. Thus, despite the fact that clusters typically provide weaker 
constraints on (constant) /jvx when compared with the CMB, they can nevertheless provide 
important bounds on scale-dependent non-Gaussianity. For example, non-Gaussianity that 
is within the CMB bounds can easily cause a 10 — 40% change in the number of clusters 
(Figure |5|). However, with upcoming experiments tight constraints on the parameter k 
(quantifying the running of the non-Gaussianity) are likely to be found only if the amplitude 
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of the primordial contribution is just within the current CMB constraints, and/or the 
running of the non-Gaussianity is large (Table || and Figure ||) . 

The galaxy bispectrum can be used to constrain non-Gaussianity on scales between 
those probed by the CMB and cluster number counts. Measurements of the bispectrum 
are sensitive to the magnitude of the non-Gaussiainity and the running parameter, and 
also distinguish between the various shapes (local and equilateral, for example). Figure ^ 
illustrates these effects. 

The three types of measurements we have made use of here (cluster number counts, 
the galaxy bispectrum, and the CMB) are all subject to different systematics and different 
degeneracies. For example, while uncertainties in the mass-observable relation or the value 
of as are particularly troublesome for cluster surveys, the reduced galaxy bispectrum is 
completely independent of these parameters. On the other hand, a potentially problematic 
degeneracy exists between galaxy bias parameters and f^[ , but cluster number counts are 
not dependent on galaxy bias. Detection of primordial non-Gaussianity will likely require 
confirmation from multiple probes. Ideally, measurements across a range of scales should 
be combined. In addition, the current WMAP data, which by itself probes a large range of 
scales, should be reanalyzed to look for non-Gaussianity of a more general, scale-dependent 
type. It is likely that the bound we have imposed on the magnitude of ftfl at the smallest 
CMB scale may be relaxed. 

Many of the issues in developing the probability distribution for the initial fluctuations, 
which limit the utility of our analysis, would be helped by further N-body simulations 
for more general non-Gaussian initial conditions. Three recent papers [^, J96|, ^| have 
examined the validity of the extended Press-Schechter approach (which we have adopted 
here) for non-Gaussianity of the local type. Their conclusions differ, and clearly more work 
should be done to determine the accuracy of the mass function we have used for models 
that are not at all like the local model. A related issue is that we have included only the 
skewness in our calculations. Including higher-order cumulants may allow one to extend 
the regime of validity of the mass function to higher masses and redshifts. 

Finally, we mention that although we have focused on the ability of near-future obser- 
vations to probe scale-dependence in the primordial non-Gaussianity, one might eventually 
hope to do much better. For example, 21 cm observations might give an enormous number 
of data points at higher redshift |9q , M. The smallest scale constraint may come from 
primordial black holes, formed by large fluctuations that re-entered very early in the ra- 



diation dominated era. Originally used as a constraint on the spectral index [100, 101 ], 
the requirement that primordial black holes that survive to the present day do not over- 
close the universe may also constrain the running of the non-Gaussianity. The relevant 
scale is roughly k ~ 10 15 Mpc _1 , many orders of magnitude beyond the range where the 
constraints we have investigated here apply. However, this is also far beyond the range 
of validity of the Edgeworth expansion {y is large) and the contributions of higher-order 
cumulants cannot necessarily be neglected. We leave a full discussion of the conditions 
under which primordial black holes constrain the running of the non-Gaussianity for future 
work 110 



Acknowledgments We thank Greg Bryan, Lam Hui, Jan Kratochvil, Louis Leblond, 



29 



Sabino Matarrese and Sheng Wang for useful discussions. We are especially grateful to 
the organizers and participants of the "Life Beyond the Gaussian" workshop where we 
presented an early version of this work. A. M. gratefully acknowledges the generous support 
of the Alfred P. Sloan Foundation. The work of S. S. and M. L. is supported by the DOE 
under DE-FG02-92ER40699. M. L. is also supported by the Initiatives in Science and 
Engineering Program at Columbia University. L. V. is supported by FP7-PEOPLE-2007- 
4-3-IRG n 202182. 

A. Review of First Order Perturbations 

In this appendix we review the calculation relating perturbations in the inflaton 8<j) to 
primordial curvature perturbations using gauge invariant variables, and then relate the 
primordial curvature perturbation to perturbations in the energy density. We begin by 
reviewing some of the gauge invariant variables discussed in Bardeen (1980) (for a more 



pedagogical introduction see for instance, 103|). We will consider scalar perturbations to 



the metric only, and neglect higher-order terms that appear in gauge transformations and 
when relating quantities using Einstein's equation. 

Begin with the following parameterization for scalar perturbations to the flat Friedmann- 
Robertson- Walker metric 

5 oo = -(1 + 24) g w = g i0 = -adiB 9ij = a 2 (t) [(1 + 2^)5^ - 2d { d 3 E\ (A.l) 

Bardeen's gauge invariant scalars are then 

B r . -I 

a(B - aE) 

® H = if; - aH 



* A = A -ai 



B-aE 



(A.2) 



where • is the derivative with respect to t and H = a/a. We can also write down the gauge 
invariant velocity perturbation 

vf J - =d i [B-aE)+ J (A.3) 

V / a(p + p) 

where p and p are the background energy density and pressure and 5T a i is a perturbation 
to the energy- momentum tensor. For completeness let us also give the gauge invariant 
energy density perturbation which is most conveniently written in Fourier space 

^..fe+W**}. (A4) 

p k z p 

where k l is the comoving wave number. Since the sum of two gauge invariant variables is 



also gauge invariant, Eq. flA.2j ) and Eq.(A.3) can be used to define a new gauge invariant 
variable 

C GJ - = * H + aH=^- (A.5) 

(A.6) 
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The expression for Q G - L can be simplified by using the first order Einstein equations, for a 
universe filled with a perfect fluid with equation of state p = wp we have 

C G.I. = ^ + ^$ + 2 H-^h. (A.7) 

s S + Sw 3(1 +w) v ; 

It can be shown that on large scales in an era where w is constant <&h = so the above 
reduces to 

C GJ - = ^-^^ A.8 

3 + 3w 

where w is the equation of state of the dominant energy component in the universe (e.g. 
w = 1/3 during radiation domination and w = during matter domination). 

A.l Relating 5(f) to ( and £ to <5 

The ADM formalism (which is reviewed nicely in ||104|] ) is useful for obtaining the relation 
between perturbations in the inflaton 5(f) and the gauge invariant scalar ( GJ \ Here we 
follow the notation of [14| writing the metric with scalar perturbations as 



ds 2 = -(1 + Nxfdt 2 + a 2 (l + 2()5 ij (dx i + N i dt)(dx j + N j dt) 
Ni = a 2 5ijN j = a 2 d^ . (A.9) 

The gauge has not yet been fully specified, but we can fix it by requiring that ( = 
(spatially flat gauge) or by choosing slices with T° { ex 5(f) = (comoving gauge). Once the 
gauge is fixed the constraint equations can be used to eliminate Ni and ip in favor of C, or 
5(f). Calculations in the two gauges can be related by using gauge invariant variables. With 
this parameterization of the metric 

>r , ,. Hik^T - , k N 

« =( -Tmrry <A - 10) 

We can see that in the comoving gauge where 5T° i = 0, C ' = £. While in the spatially 
flat gauge where £ = and <5T°j = —<fidi5(j), ( i G - 1 - = —H5(f/(f>. Here we need to relate 
C to &h during the matter dominated era, so from Eq.( |A.8[ ), <3?# = 3/5£. In conformal 
Newtonian gauge during the matter dominated era the Bardeen curvature $h satisfies 
Poisson's equation 

k 2 <S> H = AiiGa 2 5p. (A.ll) 

where we are using the Fourier convention 

5 ^ )= Swf e ^ m - (A - 12) 

Now, define the linear growth factor by <£# (k, z) = Z?(z)(l+z)<3?#(k). Putting this together 
with Eq.( |A.8j ) gives the relation between 5 and C m the matter dominated era as 

<5(k, z) = l-L-LD(z)T(k)k 2 ak). (A.13) 

i lyxi ±1 q 
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A. 2 /jvx Conventions 

In the literature there are several conventions for the definition of the non-Gaussian pa- 



rameter Jnl- In this paper, we use the convention of |24| who define Jnl in terms of the 
Bardeen curvature $>h, 

$H = $H,G + fNL($H,G-($H,G)) ( A - 14 ) 

where the extra subscript G is to indicate that the field is a Gaussian random field. Phys- 
ically, if /nl is defined as above in terms of the curvature, then a positive Jnl leads 
to negative skewness in the temperature field and positive skewness in the density field 
(corresponding to more rare objects). 

In a gauge where B — aE = (for example in the often-used conformal Newtonian 
gauge), &a = — &H 12 - So if one instead used the above expansion in terms of the gravi- 
tational potential $a then a minus sign would need to be introduced to compare with the 
WMAP /nl- To be completely explicit, suppose we defined Jnl by 

*A = $A,G + fNL(^A,G ~ (*i,G» ( A -15) 



then comparing with Eq.( [A14 ) we see Jnl = —Jnl 



This sign change is indeed necessary to compare Jml here with /tvl as given by Mal- 
dacena [14| and CHKS [21|. Maldacena defines fff L by 



c = a - g/j&cS (A.i6) 

The 3/5 appears because during the matter dominated era ( = 5/3<3?# = — 5/3&A- In 

summary, 

rhere _ , fWMAP _ i-M ( \ 17\ 

jnl — ~i-Jnl — ~jnl y.-^- 1 ') 

Our definitions of /jvl for sound speed models actually differ from those in CHKS 
1 21] in two ways, first we have allowed for tilted spectrum n s ^ 1 giving rise to the 
factors of 3™ 3-1 . Second, we differ by a minus sign because we have defined /nl here to 
agree with the WMAP convention - i.e. /nl here, Eq.( |A.14 ), is defined in terms of the 



Bardeen curvature perturbation <&h, while positive /nl in CHKS is defined in terms of the 
Newtonian potential $^- 

The above discussion is consistent with a special case that can be understood physically. 
In slow roll, in the "squeezed triangle" limit where k\ <C k2,k%, the non-Gaussianity is 
proportional to the tilt of the scalar spectral index n s — 1. To see this, consider a smooth 
potential with d^V < and relate the Hubble parameter and the variation of the inflaton 
4> by: 

-^A(f> = HAt (A.18) 

where t is time and the dot denotes a time derivative. Since H > 0, <fi > 0, a fluctuation 
Acp < (the field jumps up the potential) means At < 0. This is the usual way of thinking 



2 This statement is only true if anisotropic stress is unimportant. 
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of fluctuations of the inflaton field meaning that inflation ends at slightly different times 
in different patches. A fluctuation A(j) < means that inflation ends a little later in that 
patch and so the patch is a hotter, higher density region. In the spatially flat gauge, where 
the curvature perturbation £ (which appears in the metric) is zero, the gauge invariant 
curvature is related to the fluctuations of the inflaton field by (^ G - L = —H5(f>/(j) (consistent 
with Maldacena's Eq.(2.26)). 

For the squeezed limit, mode k\ is frozen out much earlier than &2, k%, so a fluctuation 
C > only causes &2, ^3 to exit the horizon a little earlier (At = —(,/H, exiting farther 
from the end of inflation in that patch, which occurs at the same value of <ft as in all 
other patches). If the spectrum is blue-tilted, this gives negative skewness since there will 
be less power in the fluctuations in that patch, and so less power in the largest density 
fluctuations, than if the power spectrum was flat. In his Eq.(4.9), Maldacena finds —fj^i ~ 
— (n s — 1). Then a blue tilt corresponds to f^f L > and negative skewness. We are using 
the opposite sign convention from Maldacena. In DBI, then, the largest effect will give 
negative skewness. In other models where the first term in Eq.( |3.3| ) dominates, the largest 
effect could give positive skewness. 

B. The Probability Distribution Function 

B.l Comparing the Various Distributions 

The Jml expansion is useful for examining the effect of keeping only a few terms in Edge- 
worth expansion 13 . It is also useful since one can obtain the PDF by making a formal 
change of variable in the Gaussian distribution. Doing so gives 

P(y)dv = dV *(£_ + S+) (B.l) 

V 1 + 4/jvi<r(f + f N L(r) V2vr 



£± = exp 



1 



,, ,,., \ 1 + 2f NL o{v + fNL<j) =F \A + ^Jnl<?{v + Inlct) 

4<T JNL 



Note that the natural combination here is |/jvx<t|. For |/tvlc| < 1/10, this distribution is 
smooth and resembles the Gaussian. At larger values the shape develops a feature on the 
suppressed tail. It also has an imaginary part on one tail for /jyx < 0. This formal change 
of variable is useful for comparing various truncations of the Edgeworth expansion using 
unsmoothed cumulants. 

B.2 Validity of the Edgeworth Expansion 



The Edgeworth series, Eq.( 4.lti| ) is an exact expression for the PDF in terms of its cumu- 



lants, but in practice one often calculates just the first few cumulants. How valid is the 
PDF if we use, say, only the skewness? There are several issues here. First, assuming we 
keep only the first and second terms in the series Eq. (|4.gD (up to linear in S3), the steepest 



s One can make a similar comparison with other expansions, such as the Gamma expansion developed 



in [105]. The Gamma expansion is most useful for the case of positive skewness, so we have not developed 



it here. 
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descent calculation is only valid for v < l/\Ssa\. In addition, the final distribution will 
not be positive definite unless higher-order cumulants are included. We truncate the Edge- 
worth expansion at linear order in S3, since S4 comes in at roughly the same order as Sf. 
Panel (a) of Figure || shows a comparison of a negative region for the change of variable 
distribution Eq.(B.l), the Edgeworth expansion, Eq.( 4.10| ), including up to the S^a term 
and the Edgeworth expansion including the a 2 term. The Gaussian is also plotted for 
reference. With negative skewness, the distribution has a negative region on the v > 
side. 

If 6/a < 1 the Edgeworth expansion will be a good approximation to the true PDF so 
long as 

1 »S 3 a»S 4 a 2 »S 5 a 3 ... (B.2) 



However for collapsed objects such as clusters 6/a > 1. In this case the leading term in 
each Hermite polynomial (see Eq.( f4.10] )) is H n ~ (5/a) n . Then truncating the expansion 
requires something like 



S 3 a ( — ) << 1 and S n » S n+ i<j 



for n > 3 



(B.3) 



Now, the number of terms that should be included in the Edgeworth expansion depends 
on the value of 6/a. For clusters, 6/a ~ (few) to 6/a — » 00, we would in principle need 
an infinite number of the cumulants for a valid expansion. This is a point of concern, but 
since we will be interested in the PDF integrated above some threshold and the PDF drops 
off rapidly for 6/a > 1 the situation isn't as bad is it may appear. We expect then, to 
find an expression for the PDF that has a limited range of validity. The range will depend 
on mass, since a is a decreasing function of M, and on redshift since the threshold for a 
density fluctuation to collapse, 6 C , is larger at high redshift. 

Below, panel (b) of Figure |8| shows the Edgeworth expansion results for the DBI case 
(A c term) with f^L = —256 and k = 0. The growth of the negative region at larger scales 
corresponds with expectations from panel (d) of Figure |3| 

B.3 Validity of the Non-Gaussian Mass Function 



The derivation of the mass function in §4.2 involves integrating the PDF above the threshold 
for collapse 6 C . Using the Edgeworth expansion and keeping terms up to second order one 
finds 



P(>6 C ,M) = -erfc 
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Figure 8: (a) A region of the probability distribution that is negative if the Edgeworth series is 
truncated at S3. The distributions are plotted for unsmoothed quantities, with variance a = 1 
and Jnl = —0.03. Notice that in this case, including the kurtosis cures the negative region, (b) 
Examination of the negative region of the PDF, coming from truncation of the Edgeworth expansion 
for Jnl = —256 and k = 0. Results for several values of the smoothing scale are shown. 



From this a mass function with terms up to second order (compare with Eq. fl4.19| ) which 
is to first order) can be obtained 
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Without knowledge of all higher cumulants we are forced to use an approximate form 
for the mass function. Since the skewness is the dominant contributer, we keep terms up 
to 530". We assume that when the {S30) 2 terms become important, it is no longer valid to 
neglect the S40 2 term. Similarly, we are assuming that the S^a 2 terms become important 
no sooner than the (S^a) 2 terms. Figure ^ compares the mass functions Eq.( 4.19| ) and 
Eq.( |B.5[) when one and two orders of S^a are kept. We define the regime of validity for 
our truncated mass function (Eq. (|4.19| )) to be where corrections from including the (S30") 2 
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Figure 9: Comparison of the different mass functions for different non-Gaussian models, shown 
are f^ L = —256 (black), 332 (red) and 38 (cyan). (a) Dotted lines: Comparison of the first or- 



der in S3CT mass function Eq.(4.1£) and the second order in S^cr mass function Eq.(B.5), plot- 



ted is (Eq. ([4.19|)- Eq.( |B.5| ))/Eq.( |B.5D with 5*4 = 0. Dashed lines: Comparison of the MVJ mass 
function Eq. flB.q) and the second order in S^er mass function Eq.( B.5 ), plotted is (Eq.( B.6 )- 
Eq. (|B.5|) )/Eq. (|B.5| ) with S4, — 0. (b) Dotted lines: Comparison of the first order in £317 mass 



function Eq.(4.19) and the numerically integrated all-orders in S^a mass function Eq.(B.7), plotted 
is (Eq.(|4.19|)-Eq.(B~7|))/Eq.(|B~^). Dashed lines: Comparison of the MVJ mass function, Eg. (BO) 



and the all-orders in S 3 a mass function Eq.(B.7), plotted is (Eq.(|B.6|)-Eq.(B.7))/Eq.(B.7) 



in Eq.( [B.5P reach 5%. One can see that the range of mass scales where the mass function 
Eq.( |4.19| ) is valid decreases both with redshift and as Jnl increases in magnitude. 

B.4 Comparison with MVJ 



In §4.2 we presented a mass function using the Edgeworth expansion for the non-Gaussian 
probability distribution. Another form for the non-Gaussian mass function, based only on 
the skewness was given in Matarrese, Verde and Jimenez 14 (hereafter MVJ) [|40|. 
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where <5* = 8 c yjl — S 3 5 c /3. Neither Eq.( |4.19| ) nor Eq.( |B.6| ) are exact expressions for the 
number density of massive objects. There are different ways to test these approximations. 
Perhaps the most reliable method is to compare with N-body simulations. Figure 3 of 
|96fl shows that for values of f l § c ^ 1 ~ 100 and for reasonable values of M (at least the 
range contained in Figure ph and z = the MVJ approximation is in good agreement with 



'Here we correct a typo in Eq. (68) of MVJ, this expression is in agreement with Eq. (6) of [E6[ 
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simulations. However a direct comparison of mass functions from different simulations [95 



96, 97 1 should be done before drawing definitive conclusions. A more detailed comparison 
with simulations (including for example the equilateral shape non-Gaussianity) is left to 
future work. 

Short of running simulations, one way to compare the approximations is, as discussed 
in the previous section, to consider corrections from including the (S^a) 2 . In Figure |9| we 
plot the fractional difference between the mass function of MVJ, Eq.( |B.6[ ) and Eq.( |B.5| ). 
While for f^L = 332 Eq.( [4.19 ) is a better approximation to the second order mass func- 

256 the mass function of MVJ is a better 
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tion Eq.QBJ) than MVJ, for f NL 
approximation . 

Another comparison one may make is to assume the (possibly unphysical) case of non- 
Gaussianity that produces only S3, with all higher cumulants vanishing. In this case, an 
expression for the mass function can be determined from Eq.([4.S|) and Eg. ( [4.16] ) where 
S(y) in Eq. Q4.9J ) is truncated at 53. Exchanging the orders of integration (d5 <-> dy) one 
arrives at MVJ Eq. (62) from which we can define a mass function that keeps all orders in 
S 3 a 



dn 

&1V1 all orders 



(M,z) 



d 



■k M dM 



dX 



Av 



12 sin ( X5 c {z) + 



S s (M)ai [ X 3 



(B.7) 



where A = iyja 2 M . The above expression must be numerically integrated. A comparison of 
the all-orders mass function with the first order and MVJ mass functions is shown in panel 
(b) of Figure ||. For /jvz, = 38 and —256 the mass function of MVJ more closely matches 
the all-orders mass function. While for /jvl = 332 Eq.( [4.19 ) is in better agreement. 

In this paper we have used the mass function derived from the Edgeworth expansion 
because we suspect that higher cumulants (e.g. 54) may become important at high mass 
and/or high redshift. With the Edgeworth-derived mass function we have a better analytic 
understanding the range of masses and redshifts where we expect the mass function to be 
valid when truncated at 53. 
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